Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Autostart Jupyter on node restart #111

Merged

Conversation

sammcveety
Copy link
Contributor

Per #108

For testing this, is the following the right strategy?

  1. Edit jupyter.sh to point at my branch
  2. Push jupyter.sh to GCS
  3. Run gcloud dataproc clusters create

@dennishuo
Copy link
Contributor

Ah sorry, this should be better documented. What you describe should work, but more commonly the outermost jupyter.sh doesn't need to be edited at all, in which case you should just use metadata variables to point at your github fork and branch that you're developing against. For example, in #103 I tested against the pull request's source with:

gcloud dataproc clusters create dhuo-jupyter-fixed \
    --initialization-actions gs://dataproc-initialization-actions/jupyter/jupyter.sh \
    --metadata "INIT_ACTIONS_REPO=https://github.com/bomboradata/dataproc-initialization-actions.git,INIT_ACTIONS_BRANCH=fix-jupyter-5.0.0-config-and-launch"

@sammcveety
Copy link
Contributor Author

Tested with:

gcloud dataproc clusters create sgmc-jupyter
--initialization-actions gs://dataproc-initialization-actions/jupyter/jupyter.sh
--metadata "INIT_ACTIONS_REPO=https://github.com/sammcveety/dataproc-initialization-actions.git,INIT_ACTIONS_BRANCH=add_jupyter_autostart"
--zone us-central1-b

@dennishuo
Copy link
Contributor

Thanks, LGTM!

@dennishuo dennishuo merged commit 3f8d2ad into GoogleCloudDataproc:master May 11, 2017
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants